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Abstract 

The problem of distributed or decentralized detection and 
estimation in application such as wireless sensor networks 
has often been considered in the framework of parametric 
models, in which strong assumptions are made aout a 
statistical description of nature. So the distributed learning 
method is borrowed to solve the localization problem. It 
assumes that a network with a number of beacon nodes that 
have perfect knowledge of its own coordinates and utilizes 
their knowledge as training data to perform the above 
classification. In this thesis, three approaches for distributed 
learning based on the different features that is used to 
determine the class of each node have been proposed, 
namely, the hop-count (HC) method, the density-aware 
hop-count length (DHL) method, and the distance vector 
(DV) method. These methods are compared under different 
system parameters and also compared with the triangulation 
method that is often employed in the literature. The 
simulation results show that the localization methods based 
on the distributed learning is more accurate and effectual. 
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Introduction 

Wireless sensor networks (WSNs) typically consist of a 
large number of sensors deployed over a wide area to 
retrieve information from the environment. Due to the 
large scale deployment and the limitations in size and 
cost, sensors' communications are often strictly 
constrained in terms of energy and bandwidth 
resources. It is important to utilize the resources 
efficiently in terms of transmitting only the data that is 
relevant in achieving the system task, instead of 
transmitting raw observations. 

Many sensor network applications are based on the 
fundamental task of decentralized detection or 
estimation where optimal local quantization or 
decision schemes are derived under constraints on the 
communication channel. However, most of these 



works assume parametric models that rely on perfect 
knowledge of the statistical observation models, these 
requirements are difficult to obtain in practice, 
especially as the range of applications or as the target 
environment widely varies. In localization applications, 
the environment may vastly differ from one 
application to another and the prior knowledge or 
accurate modelling of the environment may not be 
available and the distributed inference techniques 
derived under specific parametric models may not be 
robust or may even be inapplicable. 

Due to these reasons, several recent works have taken 
the nonparametric approach in studying the 
decentralized detection and estimation problems in 
sensor networks. In particular, the series of work taken 
on the learning-theoretic approach have received 
considerably attention and will be the focus of this 
thesis. In the literature on statistical learning, a set of 
training data is assumed available at the decision 
center, where computation is made by comparing new 
observations with that set of data. But, in WSN 
applications, training data may not be available at a 
single fusion center but instead distributed among 
sensors. The task in distributed learning is then to 
communicate useful information regarding the 
observations as well as the training data to perform the 
distributed inference task under communication 
constrains. 

In the distributed learning scheme proposed in, 
training data is distributed among different locations 
and, if each time a user, or fusion center, desires an 
estimation or decision, it will first transmit its local 
observation to sensors that contain the training data. 
Then theses sensors will compute a binary response 
based on the received observation and their local 
training data, and pass it back to the user. 

In this design, a WSN is considered that consists of a 
set of reference nodes that has knowledge of its own 
location as well as man oblivious sensors that can only 
estimate its location based on locally obtained 
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information. More specially, the sensor field is divided 
into multiple partitions and the distributed learning 
algorithm is used to determine which partition the 
target sensor falls into. The location of the reference 
nodes serve as training data is used to compute this 
information. 

Localization in WSNs have been studied extensively in 
the past, and most work in the literature is focused on 
range-based schemes that require a form of estimate on 
the distance between two or more sensors. Methods to 
obtain the distance estimate are such as RSSI, TOA, 
TDOA and AOA, in these methods, every sensor must 
be equipped with associated ranging hardware which 
may not be achievable in practice. Hence in this work, 
we consider the range-free alternative where 
distributed learning for localization is performed with 
only local hop-count information between the target 
sensor and the reference nodes. 

Two approaches have been taken into account: the 
approach where hop-count information is used 
directly as input to the learning algorithm and the 
approach where hop-count information is first 
converted to distance estimates before being used. 

Distributed Learing in Wireless Sensor 
Networks 

Consider a network of n sensors denoted by SI, 
S2, ...,Sn, with direct transmission links to the fusin 
center in Fig. 1. Suppose that each sensor, say Si, 

contains a training data (xi, yi), where y ' is the 

true value of the event and ' is the observation 
corresponding to the event. In the literature on 
statistical learning, xi's are known as features, 
observations, or inputs, and yi's are known as labels or 
outputs. Here the fusion center represents the user that 
is to perform the detection or estimation task. 
Specifically, the fusion center first makes an 
observation X and broadcasts it to all sensors. Upon 
reception of the observation from the fusion center, 
each sensor will choose whether or not to respond to 
the fusion center's request for information. Assume 
that each sensor may transmit at most 1 bit to the 
fusion center. If it choose to respond, a local binary 

decision, ^ m e ^ ^ , based on the available 
information, (X, xi, yi), will be computed and sent to 
the fusion center; otherwise, it will abstain from 
transmitting. At the fusion center, the local decisions 
are combined to compute the final 



decision g n (x,fc}"J 
{xi.yi} 

© 




FIG.l SYSTEM MODEL 

To be consistent with the remainder of this thesis, let 
us consider a binary classification problem where the 

label is binary, Ys {~ l > 1 }. 
Si is given by 



The local decision at sensor 



ifx^B RN (X) 



[abstain otherwise 



(1) 



wheie BmM = ^R d :\\x-4<r n } 



The final decision 
at the fusion center is based on the majority vote rule: 



g n (X) = 



1 if 



ely ^ ni 



>0 



1 otherwise 



(2) 



where Iv is the set of sensors that do not abstain from 

= \i e il,..., n\: 8„ t ■£ abstain \ 
transmission, v 1 1 ' ' > > . 

Suppose that Y is the true event or label corresponding 

to the observation X and let e (sn( x )^{ x i^i}i=i) be 
the loss of the decision rule given the set of training 

data " ' • y ' ' i=x . For the binary classification problem 
considered above, the loss is defined as 
Kg n (X\Y) = l{ g „ ( x)*r}_ Furthermore/ let us define l* as 

the error probability obtained with the optimal 
Bayesian detection where the joint statistics of (X,Y) are 
assumed to be perfectly known. 

Definition 1: Let L ^ = ' E ^ ( ^» (X) ' y) ]. Then, k»'»=i is 

* 

said to be universally consistent if " ~* for all joint 
distributions of (X,Y). 

It has been shown in that the above decision rules are 
sufficient to achieve universal consistency under 
certain conditions provided b the Stone's theorem, 
which is stated below. 



Theorem 1. If r "^° and r " ~* 00 as n 



then 
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E[L N ] -> L for all . Qint distributions of ( X ,Y). 

The results on universal consistency provided in [8] 
are proven for the case without errors on the channels. 
However, in practical systems, noise always exists in 
the transmission channels, so, we are going to consider 
this problem. 

Assume that there is error existing on the transmission 
channel from the sensor i to the fusion center. In 
particular, suppose that, the fusion center will receive 

s = 

the true local decision of sensor Si if » and the 

£ = 1 

wrong decision if ' . Thus the new received 

decision from sensor Si is ni ~ ^ ' "' . The decision 
made at the fusion center then becomes 



g n (X) = 



Therefore, it has 



1 if ^ lv5ni >0 



1 otherwise 



(3) 



Iy ni ^-ti^Iy ncorrect ^ ni £-ti<Ely nerror ^ ni 



8 ni ~ lie 



Si'e/t/ ^ ni ^S/e 



Iy nerror ^ ni 



(4) 



and also, as N goes to infinity, it has 

Sie/ V & n i ^ ^ie/y 8 ni Sie/y 8 ni _ 



(5) 



where Pe is the error probability over the network. 
From the above, we guaranteed that the system with 
transmission errors yields the same decisions as the 



>2P„ 



<-2P. 



errorless case if 



or if 



Distributed Localization Methords 

Distributed Localiz Ation Using Hop-Counts (HQ 

Let's consider a network of n nodes SI, . . .,Sn deployed 
over a square area [0,D]x[0,D]( D>0). Among the n 
sensors, k of them serve as beacon nodes which have 
knowledge of their exact geographical coordinates. 

Let (x(Si),y(Si)) be the true coordinates of node Si. 
Based on the sensors location in the x-dimension, and 



the sensors are divided into M-l disjoint classes 

Cx,l Cx,M-l where S <" e C *.' if x(S t )>tDIM ^ 

position of sensor Si is on the right side of the line 
x=tD/M, for t=l,2,...,M-l. So, it defines: 



f 1 ifS^C xt 
a.=\ ' =sign(x(S i )-tD/M) 

1-1 ifS^Cl 



(6) 



c° c 

where is the complement set of x '' . Similarly, 
we can also partition the sensors into M-l disjoint 
classes Cy,l,...,Cy,M-l based on their location in the 

y-dimension, where ^' G ^ v ' if ,)- tD I M _ So, 



a 



y _ 



[1 if Si e C yJ 
1-1 if S t * CI, 



= sign(y{S i )-tDIM) 



(7) 

C c C 
where y,t is complement set of y '' . The class 

C 

y '' contains nodes that lie above the horizontal line 
y=tD/M. 

By means of the techniques derived from the 
distributed learning literature, the goal is to determine 
which classes the target sensor belongs to. This 
effectively localizes the sensor within D/M precision 
on each dimension. The localization algorithm with 
distributed learning algorithm is described below: 

1) Initialization Phase: Upon deployment of sensors, 
each beacon node will first broadcast a HELLO 
message to all sensors in the network. The HELLO 
message contains a "hop-count" entry that will be 
increased by 1 and each time an additional hop is 
taken. If a redundant HELLO message is received at a 
sensor, the one with the larger hop-count is discarded. 
With this initialization, each node, or node Si will be 
able to obtain its hop-count vector hi. The beacon 
nodes will send this HELLO message periodically to 
be adaptive to dynamically varying environments. 

2) Local Localization Phase: When a non-beacon node 
desires an estimate of its location, it will first broadcast 
an INFO message containing both its hop-count vector 
and its ID to all beacon nodes. Based on the hop-count 
vector, each beacon will then compute a local binary 
classification on each of the 2M-2 classes, namely, 
Cx,l,...,Cx,M-l and Cy,l,...,Cy,M-l. Specially, 
corresponding to the INFO message sent by sensor Si, 
beacon node Sj will generate a response for class Cx,t 
as: 
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a. 



abstain 



\x(Sj)-tD/M\ 



otherwise 



(8) 



where ^ is the smallest integer such that^" a , hi,j is 
the hop-count from sensor Si to beacon Sj, and 
similarly for the classes on the y-dimension. 

3) Final Decision Phase: Upon reception of the local 
decision vectors, sensor Si can then use the majority 
vote fusion rule, similar to that in equation (2), to 
perform the final decision on each class. Specifically, 
the final decision on class Cx,t is made by taking 



1 



if Tj.ry s;>o 



- 1 otherwise 



(9) 



Where Iv is the set of beacon nodes that do not abstain 



from transmission. Here, 
belongs to the class Cx,t and 



= 1 



indicates that Si 



= -1 

"•' indicates that it 



belongs to the compliment class Similarly, the 

decision on the class Cy,t is 



[1 ifT jeIV S^>0 
1 otherwise 



(10) 



Given the decision vector ' ISU'—'Si.m-ij ^ tne 
localization on the x-dimension can be performed by 
traversing the binary decision tree, as shown in Fig.2. 




B74 D/2 

FIG.2 DECISION TREE 

Where M=16. Specifically, starting from the root class 
Cx,t, where t=M/2, sensor Si will go on to decide on 
class Cx,tl (where tl denotes the left child of the current 
class in the decision tree) if the decision gi,t=-l. 
Otherwise, it will move on to the class Cx,tr (where tr 
denotes the right child of the current class in the 
decision tree) if the decision gi,t=l. The pseudo-code is 
given as follows (the x-dimension localization): 



® t=M/2 (start at root of the tree Cx,M/2); 
© IF (this sensor predicts S not in class Cx,t) 

a) IF (Cx,t is a leaf node) THEN RETURN 
x(S) = (t-l/2)D/M 

b) ELSE Move to leftchild Cx,tl and set t=tl 
©ELSE 

a) IF (Cx,t is a leaf node) THEN RETURN 
x(S) = (t + l/2)D/M 

b) ELSE Move to rightchild Cx,tr and set t=tr 
© GOTO Step 2 

A similar procedure is also performed to determine the 
location of each sensor on the y-dimension. The 
estimated location coordinates for the non-beacon 

node Si is thus (x(5;) ' . 

Distributed Localization Using Estimated 
Hop-Distances 

The localization which performed by utilizing the 
hop-count information is simple and practical, 
however, the disadvantage is that, as the transmission 
radius increases or as the density of the network 
increases, the hop-count cannot accurately reflect the 
actual distance between nodes and, thus, affecting the 
accuracy of the location estimate. Although many 
ranging techniques such as RSSI, TOA, TDOA and 
AOA can provide explicit estimates of inter-node 
distances which may allow us to overcome the 
disadvantages of using only hop-count information, 
the additional hardware required may not be available, 
especially for heterogeneous or generic sensor 
networks. 

Recently, several authors proposed methods to 
estimate hop-distances based only on hop-count 
parameters, such as the density-aware hop-count 
localization (DHL) method and distance-vector (DV) 
method. Usually, both of DHL method and DV 
method use triangulation to estimate the distance 
between two nodes in the localization process. Here, 
instead, the distance estimates with obtained DHL and 
DV as inputs to the distributed learning algorithm will 
be utilized. 

Suppose that d(i, j) is the estimated distance from the 
non-beacon node Si to beacon node Sj obtained by 
either the DHL or the DV method. The response at 
beacon node Sj is as follows as 
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if d(i,j)<\x(Sj)-tD/M\ 



*H 1 

[abstain otherwise 



(11) 



on the x-dimension, and similarly as 

, y _\a] t if d(i,j)<\y(Sj)-tD/M\ 



SI 







[abstain otherwise 



(12) 



on the y-dimension. The final decision is again given 
by equation (9) and (10). 

Simulation Results 

In the experiment, assume that n=100 sensors are 
randomly deployed in a square region of [0,D]x[0,D], 
where D is set to be 30 m. And the transmission range 
is set to be r=7,10 m. In Fig. 3, it compares the 
performance of the HC method, DV method, and DHL 
method,, it shows that, for small r (r=7), the HC 
method more accurately reflects the distance to the 
destination. Also, the increase in hop-counts provides 
a more unique signature for nodes that are at different 
locations. When the number of beacons is small, the 
DV and DHL methods cannot accurately estimated the 
hop-count length and, thus the HC method performs 
better as well. However, as the number of beacon 
nodes increases for r=10, the DV and DHL methods 
outperform the HC method since the hop-count no 
longer accurately reflects the traversed distance and 
also the decrease in hop-count number makes the 
hop-count vector less identifiable for sensors at 
different locations as well. 

The advantage of DV and DHL is even more clearer as 
the node density increase as shown in Fig.4. Here, the 
methods for r=8 are compared in a fixed area with 
increasing node density. In particular, the number of 
sensors is increased from 100 to 300 while the number 
of beacon nodes is fixed at 30; It shows that even 
though the mean square error (MSE) of all schemes 
increases with the node density, the loss with 
hop-distance estimation is smaller and thus it 
eventually outperforms the hop-count method. This is 
because, for networks with high node density, both 
DHL and DV methods will more accurately reflect the 
actual distance between nodes than the hop-count 
method. 

DHL and DV Methods Using Triangulation 

The DHL and DV localization methods utilize the 
distance estimates obtained from the above methods to 



perform triangulation. Assume that there are k beacon 
nodes in the network, denoted by SI, S2, SK, at 
locations (xl, yl), (x2, y2),..., (xk, yk). Let (x0,y0) be the 
true location of the non-beacon node. Suppose that the 
non-beacon node obtains the estimated hop-distance to 
HSE vs. Beacons with i=7 
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FIG.3 COMPARISION ON THE HC, DV, AND DHL METHOD 
MSE^s. Number of Sensors, Givm 30 Beacons andr=8 
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FIG.4 NODE DENSITY INCREASING COMPARISON OF HC, 
DVAND DHL WITH R=8 

each beacon node as 0i , for i=l, . . ., k, using either the 
DHL or the DV method. The error of the hop-distance 
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estimate °' is given by 

£ 0l = d 0i - yjix, ~x Q ) 2 +( yi -y ) 2 

If we set the error to zero and rearranged the terms, it 
will has 

(4 + yl ) - 2x o x i - 2 y^i = - x ? - yf (14) 

for i=l,..., k. By subtracting each equation with the 
equation corresponding to i=k, we get rid of the 
quadratic terms and yield the following equations: 

2x Q (x k -x,) + 2y Q (y k -y,) = d 2 m - d 2 k 

2 2,2,2 

-x x -y 1 +x k +y k 
2x (x k -x k _ l ) + 2y (y k -y k _ l ) = dl k _ 1 -d 2 k 

2 2,2,2 

- - y k -i +x k+y k 

(15) 

We can rewrite the equations in the form Ax=b where 



A = 



2 ( X k 


2 (y k - 


yd ~ 




2 ( x k 


~ x i) 2(y k - 


y 2 ) 




2 ( x k 


-*k-i) 2 (yk - 


y k -i)_ 




d 2 - 

01 


A 2 2 2 

a ok x \ y\ 


2 2 


w 02 


d 2 -x 2 -v 2 

u 0k A 2 J2 


2 2 

+ x k+y k 


a 0,k-l 


— d 0k — x k _ x — 


2,2,2 

y k -i +x k +y k 



and x=[xO, yO]T. The location of the target sensor is 
then obtained by means of solving the least squares 
problem 

minlAx - b| 2 < x Q < D and < y Q < D 
x 1 1 ' (17) 

In Fig.5, it compares the performance of the 
triangulation-based localization with the schemes 
employing distributed learning, It shows that, as the 
number of beacon nodes increase, distributed learning 
outperforms the schemes based on triangulation. This 
is because the triangulation method relies largely on 
the accuracy of the distance estimate obtained with 
DHL and DV. The estimation accuracy is affected by 
many factors such as node density, transmission range, 
non-uniformity etc. On the other hand, with 
distributed learning, each beacon node matches the 
features or characteristics of the non-beacon node to its 



own in order to compute the local decision. 
Conclusion 

In this paper, several distributed learning approaces 
are proposed to localization in wireless snesor 
networks with only hop-count information at each 
node. In the system, it is assumed that the existence of 
a number of beacons that have perfect knowledge of 
their own coordinates exist and that used their 
knowledge as training data for distributed learing on 
the target sensor's location. A series of binary 
classification problems were solved to determine 
which area the target sensors falls into. The features 
that were used to perform the classification were the 
hop-counts or the hop-distances estimated via the 
DHL or DV methods. The three approaches, namely 
HC, DHL, and DV method, were compared under 
various system parameters. The proposed schemes 
were also compared to the often employed 
triangulation method. The simulation results show that 
the localization methods based on the distributed 
learning is more accurate and more effectual. 
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